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Abstract. Leftist grammars [Motwani et al. , STOC 2000] are special semi-Thue 
systems where symbols can only insert or erase to their left. We develop a theory 
of leftist grammars seen as word transformers as a tool toward rigorous analyses 
of their computational power. Our main contributions in this first paper are (1) 
constructions proving that leftist transformations are closed under compositions 
and transitive closures, and (2) a proof that bounded reachability is NP-complete 
even for leftist grammars with acyclic rules. 



1 Introduction 

Leftist grammars were introduced by Motwani et al. to study accessibility and safety 
in protection systems [MPSV00]. In this framework, leftist grammars are used to show 
that restricted accessibility grammars have decidable accessibility problems (unlike the 
more general access-matrix model). 

Leftist grammars are both surprisingly simple and surprisingly complex. Simplicity 
comes from the fact that they only allow rules of the form "a — > ba" and "cd — > d" 
where a symbol inserts, resp. erases, another symbol to its left while remaining un- 
changed. But the combination of insertion and deletion rules makes leftist grammars 
go beyond context-sensitive grammars, and the decidability result comes with a high 
complexity-theoretical price [Jur08]. Most of all, what is surprising is that apparently 
leftist grammars had not been identified as a relevant computational formalism until 
2000. 

The known facts on leftist grammars and their computational and expressive power 
are rather scarce. Motwani et al. show that it is decidable whether a given word can 
be derived (accessibility) and whether all derivable words belong to a given regular 
language (safety) [MPSV00]. Jurdziriski and Lorys showed that leftist grammars can 
define languages that are not context-free [JL07] while leftist grammars restricted to 
acyclic rules are less expressive since they can only recognize regular languages. Then 
Jurdzinski showed a PS PACE lower bound for accessibility in leftist grammars [Jur07], 
before improving this to a nonprimitive-recursive lower bound [Jur08]. 

Jurdziriski's results rely on encoding classical computational structures (linear-boun- 
ded automata [Jur07] and Ackermann's function [Jur08]) in leftist grammars. Devising 
such encodings is difficult because leftist grammars are very hard to control. Thus, 
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for computing Ackermann's function, devising the encoding is actually not the hardest 
part: the harder task is to prove that the constructed leftist grammar cannot behave in 
unexpected ways. In this regard, the published proofs are necessarily incomplete, hard 
to follow, and hard to fully acknowledge. The final results and intermediary lemmas 
cannot easily be adapted or reused. 

Our Contribution. We develop a compositional theory of leftist grammars and leftist 
transformations (i.e., operations on strings that are computed by leftist grammars) that 
provides fundamental tools for the analysis of their computational power. Our main 
contributions are effective constructions for the composition and the transitive closure 
of leftist transformations. The correctness proofs for these constructions are based on 
new definitions (e.g., for greedy derivations) and associated lemmas. 

A first application of the compositional theory is given in Section 6 where we prove 
the N P-completeness of bounded reachability questions, even when restricted to acyclic 
leftist grammars. 

A second application, and the main reason for this paper, is our forthcoming con- 
struction proving that leftist grammars can simulate lossy channel systems and "com- 
pute" all multiply-recursive transformations and nothing more (based on [CS08b]), thus 
providing a precise measure of their computational power. Finally, after our introduction 
of Post's Embedding Problem [CS07,CS08a], leftist grammars are another basic com- 
putational model that will have been shown to capture exactly the notion of multiply- 
recursive computation. 

As further comparison with earlier work, we observe that, of course, the complex 
constructions in [Jur07,Jur08] are built modularly. However, the modularity is not made 
fully explicit in these works, the interfacing assumptions are incompletely stated, or are 
mixed with the details of the constructions, and correctness proofs cannot be given in 
full. 

Outline of the Paper. Basic notations and definitions are recalled in Section 2. Section 3 
defines leftist grammars and proves a generalized version of the completeness of greedy 
derivations. Sections 4 introduces leftist transformers and their sequential compositions. 
Section 5 specializes on the "simple" transformers that we use in Section 6 for our 
encoding of 3SAT. Finally Section 7 shows that so-called "anchored" transformers are 
closed under the transitive closure operation, this in an effective way. 

2 Basic Definitions and Notations 

Words. We use x,y, u, v, w, a, P, . . . to denote words, i.e., finite strings of symbols taken 
from some alphabet. Concatenation is denoted multiplicatively with e (the empty word) 
as neutral element, and the length of x is denoted \x\. 

The congruence on words generated by the equivalences a m aa (for all symbols a 
in the alphabet) is called the stuttering equivalence and is also denoted «: every word x 
has a minimal and canonical stuttering-equivalentx' obtained by repeatedly eliminating 
symbols in x that are adjacent to a copy of themselves. 

We say that x is a subword of y, denoted x C y, if x can be obtained by deleting some 
symbols (an arbitrary number, at arbitrary positions) from y. We further write iCjji 
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when all the symbols deleted from y belong to E (NB: we do not require y G E*), and 
let □ denote the inverse relation 

Relations and Relation Algebra. We see a relation R between two sets X and Y as a set 
of pairs, i.e., some R C X x Y. We write xRy rather than (x,y) £ R. Two relations and 

R' can be composed, denoted multiplicatively with R.R', and defined by x (R.R') y <5=£ 
3z.(xRz A zfl'y). 

The union R + R', also denoted RUR', is just the set-theoretic union. /?" is the n-th 
power R.R. . .RofR and R~ l is the inverse of R: x R l y4$yRx. The transitive closure 
Un=i 2 of assumes Y =X and is denoted R + , while its reflexive-transitive closure 
is R + Uldx, denoted/?*. 

Below we often use notations from relation algebra to state simple equivalences. 
E.g., we write "R — R 1 " and "R C 5" rather than "x,Ry iffx/?'y" and "xRy implies x Sy". 
Our proofs often rely on well-known basic laws from relation algebra, like (R.R')^ 1 = 
R'-i.R- 1 , or (R + R').R" = R.R" + R'.R", without explicitly stating them. 

3 Leftist Grammars 

A leftist grammar (an LGr) is a triple G = (L,P,g) where E U {g} — {a,b, . . .} is a 
finite alphabet, g $ E is a final symbol (also called "axiom"), and P = {r, . . .} is a set of 
production rules that may be insertion rules of the form a — > ba, and deletion rules of 
the form cd — > d. For simplicity, we forbid rules that insert or delete the axiom g (this 
is no loss of generality [JL07, Prop. 3]). 

Leftist grammars are not context-free (deletions are contextual), or even context- 
sensitive (deletions are not length-preserving). For our purposes, we consider them 
as string rewrite systems, more precisely semi-Thue systems. Writing T. g for E U {g}, 
the rules of P define a 1-step rewrite relation in the standard way: for u,u' G E*, we 
write u => rp u' whenever r is some rule a — > p\ u is some W1OCM2 with |wi(X| = p and 
u' = u\$U2- We often write shortly u => r u', or even u => u', when the position or the 
rule involved in the step can be left implicit. On the other hand, we sometimes use a 
subscript, e.g., writing u =>g v , when the underlying grammar has to be made explicit. 

A derivation is a sequence Jt of consecutive rewrite steps, i.e., is some uo =^ ri < Pl 
u\ ^ r 2'P2 U2 . . . ^ r n,p n Un ^ often abbreviated as uq =>" u n , or even uq u n . A subse- 
quence => r '>P> Mi)f= TO)m -|-i ; of % is a subderivation. As with all semi-Thue sys- 
tems, steps (and derivations) are closed under adjunction: if u u' then vuw => vu'w. 

Two derivations Jti = (u u') and %2 = (v =>* v') can be concatenated in the 
obvious way (denoted %\.%2) if u' = v. They are equivalent, denoted Jii = Jt2, if they 
have same extremities, i.e., if u = v and u' = v' . 

We say that u G E* is accepted by G if there is a derivation of the form ug =>* g and 
we write L(G) for the set of accepted words, i.e., the language recognized by G. 

We say that / C E* is an invariant for an LGr G — (E, P, g) if u G / and Mg =>• vg entail 
v G /. Knowing that / is an invariant for G is used in two symmetric ways: (1) from uel 
and ug =>* vg one deduces v G /, and (2) from Mg =>* vg and v g' / one deduces u £ I. 
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3.1 Graphs and Types for Leftist Grammars 

When dealing with LGr's, it is convenient to write insertion rules under the simpler 
form "a —7- b", and deletion rules as "d ---> c", emphasizing the fact that a (resp. d) is not 
modified during the insertion of b (resp. the deletion of c) on its left. For a 6 L g , we let 

def def 

ins(a) = {b \ P 3 (a — ¥ b)} and del(a) = {b \ P 3 (a ---> b)} denote the set of symbols 
that can be inserted (respectively, deleted) by a. We write ins + (a) for the smallest set 
that contains b and ins + (£>) for all b G ins(a), while del + (b) is defined similarly. We 
say that a is inactive in a LGr if del(a) Uins(a) = 0. 

It is often convenient to view LGr's in a graph-theoretical way. Formally, the graph 
of G = (L,P,g) is the directed graph 1q having the symbols from L„ as vertices and the 
rules fromP as edges (coming in two kinds, insertions and deletions). Furthermore, we 
often decorate such graphs with extra bookkeeping annotations. 

We say that G "has type x" when 1q is a sub-graph of 1. Thus a "type" is just a 
restriction on what are the allowed symbols and rules between them. Types are often 
given schematically, grouping symbols that play a similar role into a single vertex. For 

insertion: 

~~ IS g 

deletion: 

Fig. 1. Universal type (schematically). 



example, Fig. 1 displays schematically the type (parametrized by the alphabet) observed 
by all LGr's. 

3.2 Leftmost, Pure and Eager Derivations 

We speak informally of a " letter ", say a, when we really mean "an occurrence of the 
symbol a" (in some word). Furthermore, we follow letters along steps u => v, identifying 
the letters in u and the corresponding letters in v. Hence a "letter" is also a sequence of 
occurrences in consecutive words along a derivation. 

A letter a is a n-th descendant of another letter b (in the context of a derivation) if a 
has been inserted by b (when n = 1), or by a (n — l)-th descendant of b. 

Given a step u ^> r > p v, we say that the p-th letter in u, written u\p], is the active 
letter, the one that inserts, or deletes, a letter to its left. This is often emphasized by 
writing the step under the form (u —)u\au2 => u[au2(— v) (assuming u[p] = a). 

A letter is inert in a derivation if it is not active in any step of the derivation. A set 
of letters is inert if it only contains inert letters. A derivation is leftmost if every step 
u\aii2 => u\au2 in the derivation is such that u\ is inert in the rest of the derivation. 

A letter is useful in a derivation % = (u v) if it belongs to u or v, or if it inserts 
or deletes a useful letter along K. This recursive definition is well-founded: since let- 
ters only insert or delete to their left, the "inserts-or-deletes" relation between letters is 
acyclic. A derivation % is pure if all letters in % are useful. Observe that if % is not pure, 
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it necessarily inserts at some step some letter a (called a useless letter) that stays inert 
and will eventually be deleted. 

A derivation is eager if, informally, deletions occur as soon as possible. Formally, 
71 = (mo => r i'Pi u\ ■ ■ ■ ^"'P" u n ) is not eager if there is some w,_i of the form \v\baw2 
where b is inert in the rest of TC and is eventually deleted, where P contains the rule 
a ---> b, and where r, is not a deletion rule. 1 

A derivation is greedy if it is leftmost, pure and eager. Our definition general- 
izes [Jur07, Def. 4], most notably because it also applies to derivations ug =>* vg with 
nonempty v. Hence a subderivation %' of tc is leftmost, eager, pure, or greedy, when tc 
is. 

The following proposition generalizes [Jur07, Lemma 7]. 

Proposition 3.1 (Greedy derivations are sufficient). Every derivation tc has an equiv- 
alent greedy derivation %' . 

Proof. With a derivation tc of the form uq => n - Pl u\ => r 2'f2 m 2 • • • =^ r " ,p " u n , we associate 

def 

its measure //(tc) — (n,pi, ... ,p n ), a (n + l)-tuple of numbers. Measures are linearly 
ordered with the lexicographic ordering, giving rise to a quasi-ordering, denoted <„, 
between derivations. A derivation is called ^-minimal if any equivalent derivation has 
greater or equal measure. 

We can now prove Prop. 3.1 along the following lines (see Appendix A for full 
details): first prove that every derivation has a /j-minimal equivalent (Lemma A. 1), then 
show that /j-minimal derivations are greedy (Lemma A.2). □ 

Observe that < M is compatible with concatenation of derivations: if 7Ci <f, 7C2 then 
7C.7Ci.7c' <fj 7C.7C2.7c' when these concatenations are defined. Thus any subderivation of a 
/j-minimal derivation is ^-minimal, hence also greedy. 

jU-minimality is stronger than greediness, and is a powerful and convenient tool for 
proving Prop. 3.1. However, greediness is easier to reason with since it only involves 
local properties of derivations, while /j-minimality is "global". These intuitions are re- 
flected by, and explain, the following complexity results. 

Theorem 3.2. 1. Greediness (deciding whether a given derivation TC in the context of a 
given LGr G is greedy) is in L. 

2. jU-Minimality ( deciding whether it is p-minimal) is co N P '-complete, even if we restrict 
to acyclic LGr's. 

Proof. 1. Being leftmost or eager is easily checked in logspace (i.e., is in L). Checking 
non-purity can be done by looking for a last inserted useless letter, hence is in L too. 
2. /j-minimality is obviously in coNP. Hardness is proved as Coro. 6.9 below, as a 
byproduct of the reduction we use for the N P-hardness of Bounded Reachability. □ 



Eagerness does not require that r ( - deletes b: other deletions are allowed, only insertions are 
forbidden. 
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4 Leftist Grammars as Transformers 

Some leftist grammars are used as computing devices rather than recognizers of words. 
For this purpose, we require a strict separation between input and output symbols and 
speak of leftist transformers, or shortly LTr's. 

4.1 Leftist Transformers 

Formally, an LTr is a LGr G = (T.,P,g) where E is partitioned as A tfclB WC, and where 
symbols from A are inactive in P and are not inserted by P (see Fig. 2). This is denoted 
G : A h C. Here A contains the input symbols, B the temporary symbols, and C the 
output symbols, and G is more conveniently written as G = (A,B,C,P,g). When there 
is no need to distinguish between temporary and output symbols, we write G under the 

def 

form G = (A,D,P,g), where D = BUC contains the "working" symbols, 

a 

A - D i g 

4 ■ 

Fig. 2. Type of leftist transformers. 

A consequence of the restrictions imposed on LTr's is the following: 

Fact 4.1 A*D* is an invariant in any LTr G — {A,D,P,g). 

With G = (A,B,C,P,g), we associate a transformation (a relation between words) 
fi G CA*x C* defined by 

def 

u Rq v <^ ug =>5 vg A u G A* A v G C* 

and we say that G realizes Rq. Finally, a Ze/h'sf transformation is any relation on words 
realized by some LTr. By necessity, a leftist transformation can only relate words written 
using disjoint alphabets (this is not contradicted by e Rq £)• 

Leftist transformations respect some structural constraints. In this paper we shall 
use the following properties: 

Proposition 4.2 (Closure for leftist transformations, see App. B). If G : A h C is a 

leftist transformer, then Rq = . w .Rq. «). 

4.2 Composition 

We say that two leftist transformations R\ C A\ x Cj* and /?2 C A| x C| are chainable 
if Ci = A2 and Ai DC2 = 0. Two LTr's are chainable if they realize chainable transfor- 
mations. 

Theorem 4.3. The composition R1.R2 of two chainable leftist transformations is a left- 
ist transformation. Furthermore, one can build effectively a linear-sized LTr realizing 
R\ .R2 from LTr's realizing Ri and R2- 
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For a proof, assume G\ = (Ai,Bi,Ci,Pi,g) and G 2 — (A 2 ,B 2 ,C 2 ,P 2 ,g) realize Ri and 
R2- Beyond chainability, we assume that A 1 L)B\ and B2 UC 2 are disjoint, which can be 
ensured by renaming the intermediary symbols in B\ and fi 2 . The composed LTr G\ .G2 
is given by 

G1.G2 = (Ai,Bj UCi UB 2 ,C 2 ,Pi UP 2 ,g). 
This is indeed a LTr from A 1 to C 2 . See Fig. 3 for a schematics of its type. Since Gi.G 2 



g 




Fig. 3. The type of Gi.G 2 . 



has all rules from G\ and G 2 it is clear that (=>Gi + =^G9 ) C^g> from which we deduce 
/?G] ^G 2 f= Rgi.g 2 - Furthermore, the inclusion in the other direction also holds: 

Lemma 4.4 (Composition Lemma, see Appendix C). /?d g 2 = Rgi R G 2 - 

Remark 4.5 (Associativity). The composition (G\ .G 2 ) .G3 is well-defined if and only if 
Gi .(G2.G3) is. Furthermore, the two expressions denote exactly the same result. □ 

5 Simple Leftist Transformations 

As a tool for Sections 6 and 7, we now introduce and study restricted families of leftist 
grammars (and transformers) where deletion rules are forbidden (resp., only allowed on 
A). 

An insertion grammar is a LGr G = (E,P,g) where P only contain insertion rules. 
See Fig. 4 for a graphic definition. For an arbitrary leftist grammar G, we denote with 
G ms the insertion grammar obtained from G by keeping only the insertion rules. 

The insertion relation I G C E* x E* associated with an insertion grammar G = 

def - 

(L,P,g) is defined by u Iq v O ug =>g vg. Obviously, Iq C C e . Observe that /g is not 
necessarily a leftist transformation since it does not require any separation between in- 
put and output symbols. 

A simple leftist transformer is an LTr G = (A,B,C,P,g) where B = and where no 
rule in P erases symbols from C. See Fig. 4 for a graphic definition. We give, without 
proof, an immediate consequence of the definition: 

Lemma 5.1. Let G = (A,0,C,P,g) be a simple LTr and assume ug => G vg for some 
ueA* and v G C*. Then k=\u\ + |v|. 
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Fig. 4. Types of insertion grammars (left) and simple leftist transformers (right). 



Given a simple LTr G = (A,0,C,P,g) and two words u = a\ ■ ■ -a„ <E A* and v = 
c\ ■ ■ -c m G C*, we say that a non-decreasing map h : {1, . . . ,n} — > {l,...,m} is a G- 
witness for w and v if P contains the rules c h ^ ---> a, and Cj+\ — > cj (for all i = 1, . . . ,n 
and j — l,...,m, with the convention that c m+ \ = g). Finally, we write uVqv when 
such a G-witness exists. Clearly, Vq C R g . Indeed, when G is a simple transformer, Vg 
can be used as a restricted version of Rg that is easier to control and reason about. 

Lemma 5.2 (See App. D). Let G = (A,0,C,P,g) be a simple LTr. Then Rg = Ve./^™. 

Combining Lemma 5.2 with Ida* Q / G ins C C c , we obtain the following weaker but 
simpler statement. 

Corollary 5.3. Let G = (A,0,C,P,g) be a simple LTr. Then V C C« G C V G . C c . 
5.1 Union of Simple Leftist Transformers 

We now consider the combination of two simple LTr's G\ = (A,0,C\,P\,g) and G2 = 
(A,0,C2,P2,g) that transform from a same A to disjoint output alphabets, i.e., with 

def 

Ci nC 2 = 0. We define their union with Gi +G 2 = (A,0,Ci UC 2 ,Pi UP 2 ,g). This is 
clearly a simple LTr with (Rg { +Rg 2 ) ^= ^Gi+g 2 - 11 further satisfies: 

Lemma 5.4. Ifu /?Gi+g 2 v tnen u (^Gi +^g 2 ) v 'f or some v' C v. 

Proof. Assume u Rg 1 +g 2 v - With Cor. 5.3, we obtain u V Gi +g 2 v ' f° r some v' = c\ ■ ■ ■ c m C 
v. Hence G\ + G 2 has insertion rules c ;+ i — ?- c ; for all j = l,...,m, and deletion rules 
of the form c h ^ ---> u[i]. Since C\ and C 2 are disjoint, either all these rules are in G\ (and 
u Vg, v'), or they are all in G 2 (and u V Gl v'). Hence u {Rq 1 +Rg 2 ) v ' ■ ^ 

6 Encoding 3SAT with Acyclic Leftist Transformers 

This section proves the following result. 

Theorem 6.1. Bounded Reachability and Exact Bounded Reachability in leftist gram- 
mars are N F '-complete, even when restricting to acyclic grammars. 

(Exact) Bounded Reachability is the question whether there exists a n-step deriva- 
tion u => n v (respectively, a derivation u =>-" v of non-exact length at most n) between 
given u and v. These questions are among the simplest reachability questions and, since 
we consider that the input n is given in unary, 2 they are obviously in NP for leftist 
grammars (and all semi-Thue systems). 



It is natural to begin with this assumption when considering fundamental aspects of reachabil- 
ity since writing n more succinctly would blur the complexity-theoretical picture. 



Toward a Compositional Theory of Leftist Grammars and Transformations 



9 



Consequently, our contribution in this paper is the N P-hardness part. This is proved 
by encoding 3SAT instances in leftist grammars where reaching a given final v amounts 
to guessing a valuation that satisfies the formula. While the idea of the reduction is easy 
to grasp, the technicalities involved are heavy and it would be difficult to really prove 
the correctness of the reduction without relying on a compositional framework like the 
one we develop in this paper. It is indeed very tempting to "prove" it by just running an 
example. 

Rather than adopting this easy way, we shall describe the reduction as a composition 
of simple leftist transformers and use our composition theorems to break down the 
correctness proof in smaller, manageable parts. Once the ideas underlying the reduction 
are grasped, a good deal of the reasoning is of the type-checking kind: verifying that 
the conditions required for composing transformers are met. 

Throughout this section we assume a generic 3SAT instance 4> = Ki=\Ci wim m 
3-clauses on n Boolean variables in X = {x\,... ,x n }. Each clause has the form C, = 
\j\=\£i,kXi,k f° r some polarity e,^ 6 {+,— } and <E X. (There are two additional 
assumptions on <I> that we postpone until the proof of Coro. 6.5 for clarity.) We use 
standard model-theoretical notation like |= <t> (validity), or o (= <t> (entailment) when o 
is a Boolean formula or a Boolean valuation of some variables. 

We write a[x h-> b] for the extension of a valuation o with (x,b), assuming x 
Dom(o). Finally, for a valuation : X — > {T, _L} and some j = 0, . . . ,n, we write 8 7 to 
denote the restriction Q\{ X[t ... tXj } of on the first j variables. 

6.1 Associating an LTr G<t> with 4> 

For the encoding, we use an alphabet E = {T/ ,U- ,T'\,U' 3 i \ i = 1,.. . ,m A j = 0, ...,«}, 
i.e., 4(n + 1) symbols for each clause. The choice of the symbols is that a U means 
"Undetermined" and a T means "True", or determined to be valid. 

For j = 0, . . . ,n, let Vj d ^ {£//,..., U,i V} = {U'{,..., U" m ,T'{,..., I"'}, 

def 

and Wj = Vj U Vj, so that E is partitioned in levels with E = U"=o Wj. With each xj 6 X 
we associate two intermediary LTr's: 

Gj d ^ f <yrj- U 0,Vj,Pj, g ), Gj d ^ f (Wj- h 0,v;,p'j, g ) 

with sets of rules Pj and P'-. The rules for Gj are given in Fig. 5: some deletion rules are 
conditional, depending on whether Xj appears in the clauses Ci,. . . ,C m . The rules for 
Gj- are obtained by switching primed and unprimed symbols, and by having conditional 
rules based on whether -acj appears in the C,'s. One easily checks that Gj and Gj 
are indeed simple transformers. They have same inputs and disjoint outputs so that 
the union (Gj + Gj-) : Wj-\ h Wj is well-defined. Hence the following composition is 
well-formed: 

^ (Gj +Gi). (Gj + Gi)- (Gj +G£). 

We conclude the definition of G<t> with an intuitive explanation of the idea behind the re- 
duction. Go operates on the word uq = U® ■■■U® where each Uf stands for "the validity 
of clause Ci is undetermined at step 0(i.e., at the beginning)". At step j, Gj + Gj picks 
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Fig. 5. Py, the rules for Gj: Fixed part on left, conditional part on right. 



a valuation forx,-: Gj picks "Xj = T" while Gj" picks "jty = _L". This transforms u( 1 
into U-, and Tj 7 1 into T/, moving them to the next level. Furthermore, an undetermined 
U(~ X can be transformed into T/ if C, is satisfied by Xj. In addition, and because Gj 
and G| must have disjoint output alphabets, the symbols in the V/s come in two copies 
(hence the Vj's) that behave identically when they are input in the transformer for the 
next step. 

The reduction is concluded with the following claim that we prove by combining 
Corollaries 6.5 and 6.8 below. 

* is satisfiable iff U°U$ ■ ■ ■ U°g =s^ T?Tf -T^g 

iff U?U° ■ ■ ■ U°g ^f™ T?TZ ■ ■ ■ Tig (Correctness) 

■^Ulul-Ulg^T^-Tlg. 

Observe finally that G$ is an acyclic grammar in the sense of [JL07], that is to say, 
its rules define an acyclic "may-act-upon" relation between symbols. Such grammars 
are much weaker than general LGr's since, e.g., languages recognized by LGr's with 
acyclic deletion rules (and arbitrary insertion rules) are regular [JL07]. 

Remark 6.2. The construction of G<t> from <t>, mostly amounting to copying operations 
for the Gj's and Gj-'s, to type-checking and sets-joining operations for the composition 
of the LTr's, can be carried out in logarithmic space. □ 

6.2 Correctness of the Reduction 

We say that a word u is j -clean if it has exactly m symbols and if u[i] € {T- ,T'j ,U- ,U'j} 
for all i = 1 , . . . , m. It is T -homogeneous (resp. ^-homogeneous) if it does not contain 
any (resp., only contains) primed symbols. 

Let < j < n and 8 7 be a Boolean valuation of x\ , . . . ,xy. we say that a y-clean u 
respects (4> under) 9^ when, for all ;' = 1, . . . ,m, 6, |= C, when u[i] is determined (i.e., 
€ T/ + T'j). Finally u codes (<t> under) 8 7 if additionally each u[i] is determined when 
Qj \= d. Thus, a word u that codes some 8 ; exactly lists (via determined symbols) the 
clauses of 4> made valid by Qj, and the only flexibility in u is in using the primed or the 
unprimed copy of the symbols. Hence there is only one j-clean u coding 8 7 that is T- 
homogeneous, and only one that is _L-homogeneous. If u respects Qj instead of coding 
it, more latitude exists since symbols may be undetermined even if the corresponding 
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clause is valid under Bj. 

Assume that, for some j G {1, . . . ,«}, Uj-\ codes 8 ; _i and Uj codes Qj. Write b for 
6(jc,)(NB:ie{T,_L}). 

Lemma 6.3. If Uj is b-homogeneous then uj-\ V G b uj. 

Proof. Let h = M{ !,...,»,}. We claim that h is a G^-witness for Uj-\ and k,-, i.e., that G* 
contains the required insertion and deletion rules. 

Insertions. G b - has all insertion rules g — > Uj[m] — > Uj[m — 1] — > . . . — > uj[l] (leftmost 
rules in Fig. 5) since Uj is ^-homogeneous. 

Deletions. G b j has all deletion rules Uj[i] ---> Uj-\ [/]. Firstly, both undetermined symbols 

U'j and U"j may delete their counterparts t/j_j and U"j_i, and similarly for the deter- 
mined symbols (the unconditional deletion rules in Fig. 5). This is used if C, is not more 
valid under Qj than under 9/_i. Secondly, if C, is valid under 8 7 but not under Qj-i, then 
Xj \= Ci (or -ixj \= C, depending on b) and the conditional rules in Fig. 5 allow a deter- 
mined T/ (or T'j depending on b) to delete U- or U'j . □ 

Lemma 6.4. If Uj is b-homogeneous, then u^\g ujg. 

Proof. From V G t uj (Lemma 6.3) we deduce Uj-\ R G b uj, i.e., Uj-\g =>* G b u jg> by 
Lemma 5.2, and then Uj-\g =>^" ujg by Lemma 5.1. □ 

Corollary 6.5. If<P is satisfiable, then Uf ■ ■ ■ U? n g T? ■ ■ ■ T£g. 

Proof. Since <I> is satisfiable, 8 |= <t> for some valuation 8. For j — 1 , . . . , m, we write bj 
for Q(xj) and let Uj be the only y'-clean -homogeneous word that codes for 8 7 . 

We now make two assumptions on <t> that are no loss of generality. First we require 
that no clause C, contains both a literal and its negation, hence no C, is tautologically 
valid. Then mo = f U® ■ ■ ■ U® codes the empty valuation 8o- Second, we require that 4> 
is only satisfiable with b n = T (which can be easily ensured by adding a few extra 
variables). Then necessarily u„ = T" ■■■ T£. 




for all b and j, we deduce uog =>g™" u n g as claimed. □ 

Fix some 8, some j G {1, . . . ,n} and let b = 0(xj). 
Lemma 6.6. Ifu respects 8j_i and u V G t v, then v respects Qj. 

Proof. Write / for |v|. From u V G b v (witnessed by some h) we deduce that G h j has 

insertion rules g —> v[l) — > v[l — 1] — > . . . — > v[l]. Inspecting Fig. 5, we conclude that 
necessarily I < m. Since deletion rules v[h(i)} ---> u[i] are required for all i = 1,. . . ,m, 

we further see from Fig. 5 that h is injective, so that / > m. Finally I — m,h — Id{\ m j, 

v is j-clean and ^-homogeneous. 
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Now, knowing that G h j contains the rules v[i] ---> u[i], we show that v respects 9 ; . 
Suppose, by way of contradiction, that it does not. Thus there is some i £ { 1 , . . . , m} 
with v[i] = T/ (assuming b = T w.l.o.g.) while 9 ; ^ Q (so that 8/_i ^ Q). From 9/ ^ Q 
we deduce that Xj y= Ci. Hence G b j does not have the conditional rules T/ ---> Uf 1 and 

fj U'j~ l . Thus u[i] {U-~ l ,U'j~ 1 }. But then u does not respect 9 y _i, contradicting 
our assumption. □ 

We immediately deduce: 

Lemma 6.7. If x R G b y and there is some u\—x that respects 9y_i, then there is some 
vCj that respects 9/. 

Proof. From the Closure Property 4.2, we get u R G b y. Then, from R G b C V G b. C 
(Coro. 5.3) we deduce u V G b v for some vCy. Now v respects 9, thanks to Lemma 6.6. 

' a 

Corollary 6.8. 7/f/f ■ • • t/°g =^ • • • 7^g, f/zen 4> is satisfiable. 

Proof. Write mo for t/j° • • • and «„ for T" • • • T^. From the definition of G<t> and the 
Composition Lemma 4.4, we deduce that there exist some words u\ , . . . , w„_i such that 
Uj-\ R g t +g ±- Uj for all j = 1, . . . ,«. 

With Lemma 5.4, we further deduce that there exist some words and 
Boolean values b\,...,b n such that m' C u , and u;-\R * . t/ for all /' = 1 , . . . ,n. Hence 

also m' 7 _[ /?^ by Prop. 4.2 (and letting m = mo). 

Write 9 for [jci >->• b\, . . . ,x n n> b n }. With Lemma 6.7, induction on j, and since u' 
respects 9o, we further deduce that there exists some words such that, for all 

j = l,...,n, u'j C and u'j respects Qj. From \u%\ — m (it respects 9) and u" n C m„, we 
deduce that u" n = u„. Finally, 9 |= 4> since u" n respects 9 and u" n = u„ = T" ■ ■ -T£. □ 

Corollary 6.9. /it-Minimality of a derivation is coUP-hard. 

Proof (Sketch). We define G^ by taking Go, adding k extra symbols ai,...,^, and 
adding the following two sets of rules: 

(1) all — > a,- and a,_i ---> a,- for i = 1, . . . , & (with the convention that ao is r™); 

(2) all ak —> Uf for i = 1 , . . . , m. 

Observe that G^ is acyclic. It has a derivation jc : t/f • • ■ U%g ^ 2m + 2k T? ■ ■ ■ T£g of 
the following form: 

t/° • • ■ U°g ^ m t/° • • ■ t/°77 • • • % ^ t/° ■ ■ ■ I/Ofl^., ■ ■ -a^" • • • r»* 

This derivation uses the extra symbols to bypass the normal behaviour of G<t>. If k is 
large enough, i.e., k > m(n — 1), Jt is /j-minimal if, and only if, 4> is not satisfiable. □ 
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7 Anchored Leftist Transformers and Their Transitive Closure 

When bi,b2 G B are two different working symbols, and (A,B,C,P,g) is a LTr, we 
call G = (A,B,C,bi,b2,P,g) an anchored LTr, or shorly an ALTr. With an ALTr G we 
associate an anchored tranformation SqCA* x C* defined by 



Here the anchors b\,bj are used to control what happens at the left-hand end of trans- 
formed words. Mostly, they ensure that the derivation b\ug=>* bjyg goes all the way to 
the left and erases b\ rather than stopping earlier. One intuitive way of seeing Sg is that 
it is a variant of Rq restricted to derivations that replace the anchors. 

A first difficulty for building the transitive closure of an anchored transformation 
So C A* x C* is that the input and output sets are disjoint (a requirement that allowed 
the developments of Sections 4 and 5). To circumvent this, we assume w.l.o.g. that 
A and C are two different copies of a same set, equipped with a bijective renaming 
h : C* —5- A*. Then, the closure Sc-ih-Sc)* behaves like we would want S G to behave. 

For the rest of this section, we assume h is a bijection between C and A. W.l.o.g., we 
write A and C under the forms A = {a\, . . . ,a n } and C = {c\, . . . ,c n } so that h(ci) = a,- 
for all ; = l,...,n. Then h is lifted as a (bijective) morphism h : C* — > A* that we 
sometimes see as a relation between words. 

The exact statement we prove in this section is the following: 

Theorem 7.1 (Transitive Closure). Let G: Ah- C be an ALTr such that Sg = Sg- Ec- 
Then there exists an ALTr G^ : A h C such that S G (+) — SG-(h-Sc)*- 
Furthermore, it is possible to build G^ + ' from G using only logarithmic space. 



is called a renamer (of C to A), and often shortly written R. Observe that R : C h A is 
indeed an ALTr. It further satisfies Sr = w . C .h. 

We shall now glue an ALTr G : A h C with the renamer R : C h A into some larger 
LGr H. But before this can be done we need to put some wrapping control on G (and 
on R) that will let us track what comes from G inside H's derivations. 

Formally, given an ALTr G = (A,B 7 C,bi,b2,P,g) and two new anchor symbols 

def 

□ l, O2 we letE D = {□!, EI2} and define anew ALTr Fg,oi,D2 ( or shortly just Fg) 
for "wrapping G with □i,D2", and given by n 2 "= (A,B,C, Di, U2,P',g) where 

- A = A U A' U {bi ,b[ }, A' , b\ being a copy of A , b 1 , 

-8={ni,n 2 }uB\H 

- C = C U {b 2 } U C US' \ {b\ } , B' and C being copies of B and C. 

Finally, let D = C U B and D' = f C' U B' . (The copies are denoted by priming the original 
symbols, and a primed set like A 1 — {a' \ a e A) is just the set of corresponding primed 
symbols.) The rules in P' are derived from the rules of P in the following way. (See 
Fig. 6 in App. E for a schematic type.) 



uS G v <£> biug^* G b 2 vg. 



Let b u b 2 ^AUC. The ALTr R blM = (C 7 b 2 ,buA,P R ,g) with 
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kept: P' retains all rules of P that do not erase a letter in A U {b\ }, 

replace: P' has a rule d' —>a for each rule d ---> a in P that erases a letter in A U {b\ }, 

mirror: P' has a rule d^d' for each d £ D, 

clean: P' has all rules d' — > e' and D 2 — > a' for d',e' 6D'\ {fe'J and a'eA'U {fe'J, 
fr-rules: P' has the rules n 2 — > Di and all rules d' — > D 2 for d' e D' \ {b[}. 

We now relate the derivations in G and the derivations in Fq. For this, assume w £ 
(A + b x )* and v£ (C + £ 2 )+. 

Lemma 7.2 (See App. E.l). 1. Ifu.g ==><t v.g then for all words a G (A' + f/zere 
era/s a symbol P £ C'U{/> 2 } smc/j f/iaf □ l.OC.w.g =>^ G □ l.OC.p.v.g D 2 .p.v.g. 
2. Reciprocally, for all a e (A' + fo'J^/or a//p e (C" + 2> 2 )+ (/"□i.a.H.g =^ q D 2 .p.v.g 
f/ien w.g =>J v.g. 

Thus we can relate anchored derivations in Fq with anchored derivations in G via: 

Corollary 7.3. Let u £ (A+b\)* and v £ (C + /> 2 ) + . Then by.u.g =>J bi-v.g if and only 
if there exists p £ (C' U {b 2 }) shc/i f/W □ i.OC.&i.K.g =>J G D 2 .p.^ 2 .v.g. 7n of/zer words, 
uScviffa.bi.uSFc $.b 2 .v for some P £ (C'U {#,}). 

We may now glue the wrapped versions of G and its associated R. Recall that Fq = 

(A, 5, C, □ i , D 2 , P', g) . We denote the set of new symbols with E = f A U B U C and ob- 
serve that P R (short for Pr^ h , D2lDl ), being some (CuC U {fc 2 , fo 2 } , E n , A, D 2 , □ i , P R , g), 

does not use more symbols. Let H = f (E, P# , g) be the LGr such that and P// = P' U P R . 
Essentially, // is a union of the two wrapping ALTr's. (See Fig. 7 in App. E.2 for a 
schematic description). Note that H is not a LTr since it does not respect any distinc- 
tion between input, intermediary, and output symbols. 

Lemma 7.4 (See App. E.3). Let a, p £ A' + and u,veA*. Ifn i .a.u.g =>* H □ i .p. v.g and 
S G = (Ea -So- Ec) rten m c a .(5 g .A)* v. 

We now extend H to turn it into an ALTr H' : A h A U A', introducing again new 
copies, denoted a, . . . , of previously used symbols and writing u = d\di ...d n for the 
dotted copy of some u~a\ai...a n . Formally, 

//' = (A,PUP'ucuc'u{ni,n 2 ,[ji,[j 2 },AuA , ,[j 1 ,[j 2 ,p , ',g) 

where P" extends Ph by the rules [j 2 ---> di, Di — > [j 2 , and all a ---> a for a £ A. 
The anchored transformation S^' computed by H 1 is captured by the following: 

Lemma 7.5 (See App. E.4). Let u,v £ A*. Then u S H i Di.p.v for some p £ A' + (jj 
u [h. Q A .(S G .h)*} v. 

We are nearly done. There only remains to compose /P with a LTr that checks for 
the presence of Di.p (and then erases it). For this last step, we shall use further dotted 
copies E, E, . . . , of the previously used symbols. 

Formally, we define two new ALTr's T\ and T 2 : see App. E.5. The rules of T\ ensure 
that it satisfies 



uSti v iff u = □ i .<X.b\ .u' and ii I T i ns v. 



(Pi-spec) 
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Regarding T2, let u 6 (A UA' U {b\,b[})* and v G A*. If «' is the largest subword of u 
such that t/ e A*, then 

m 5r 2 v iff u Q-£ v. (r2-spec) 
Combining (Fi-spec) and (r2-spec) we obtain 

u St { -St 2 v iff u = □ 1 .OC.&i .w' and t/ C-j v. 

Composing these LTr's as H' .T\ .T2 yields a resulting : A h A, which, up to a bi- 
jective change of symbols, is what we need to build to prove Theorem 7.1. 

8 Conclusion 

In this paper we introduce a notion of transformations computed by leftist grammars and 
define constructions showing how these transformations are effectively closed under 
sequential composition and transitive closure. 

These operations require that some "typing" assumptions are satisfied (e.g., we only 
know how to build a transitive closure on leftist transformers that are "anchored"*) which 
may be seen as a lack of elegance and generality of the theory, but which we see as an 
indication that leftist grammars are very hard to control and reason about. 

Anyway, the restrictive assumptions are not a problem for our purposes: we intend 
to rely on the compositional foundations for building, in a modular way, complex leftist 
grammars that are able to simulate lossy channel systems. Here the modularity is essen- 
tial not so much for building complex grammars. Rather, it is essential for proving their 
correctness by a divide-and-conquer approach, in the way we proved the correctness of 
our encoding of 3SAT instances in Section 6. 

As another direction for future work, we would like to mention that the proof that 
accessibility is decidable for LGr's (see [MPSVOO]) has to be fixed and completed. 

Acknowledgements. Sylvain Schmitz helped tremendously with his numerous remarks 
and suggestions. 
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A Greedy derivations are sufficient 

The lexicographic ordering between derivations is denoted <i ex - 
Lemma A.l. Every derivation has a /j-minimal equivalent. 

Proof. Direct from observing that < M is a well-founded quasi-ordering over derivations. 
Indeed, while <i ex is not well-founded over the set of tuples of natural numbers, it is 
well-founded over the set UhgnI"} x to which measures of derivations belong. □ 

The proof of Proposition 3.1 is concluded with the following: 

Lemma A.2. A p-minimal derivation is greedy. 

Proof. By combining lemmas A.4, A.5 and A. 6 below. □ 

Lemma A.3. Assume % = uq =^> r i^i u\ =$* r i>Pi m 2 is a two-step derivation. If pi < pi — 
1, or if p2 = pi — 1 and r\ is an insertion rule, then % is not p-minimal. 

Proof. The hypothesis ensures that the two steps do not interfere. Thus they can be 
swapped, yielding an equivalent derivation %' = uq =>'-=>' 1 m. Clearly, %' < M %. □ 

In the rest of this section, we consider a generic transformation jc of the form 

u => r i'-Pi u\ =^ r 2,P2 Ul . . . ^r„, P „ Un in context f some LGr G = (Z,P,g). 

Lemma A.4. A p-minimal derivation is leftmost. 

Proof. Assume jc is not leftmost. Then it contains a step = \v\aw2 => w[aw>2 = U[ 
where w\ is not inert in the rest of Jt. Let j > i be the first step after i where a letter of w\ 
is active: the subderivation w ; _2 =>-0'-i>Py-i Uj-\ => r i< p i has pj < Pj-i, and even pj < 
Pj-i — 1 if rj-\ is a deletion rule. Lemma A. 3 applies and entails that this subderivation, 
hence also Jt, is not /j-minimal. □ 

Lemma A.5. A p-minimal derivation is eager. 

Proof. Assume jc is not eager. Let =$> r '>Pt m be the first step that violates eagerness: 
then Ui-\ is some w\baw2, w\b will remain inert in the rest of jc, and b will eventually 
be deleted at some step j > i, but this is not done right now even though P contains 

a —>b. 

We now consider several cases for step i. If the active letter occurs to the right of a, 
then one obtains a new derivation jc' by deleting b (using a ---> b) right now, continuing 
like jc, and skipping step j since b has already been deleted. This produces an equivalent 
derivation, with p(%') — (n,pi,. . . ,/>,_!,/, . . .} where / = \w\b\ < pi. Hence jc is not p.- 
minimal. If a is the active letter, the step must be an insertion a — > c and pi + \ > p,-: 
we obtain, as in the previous case, an equivalent tc' with /j(tc') = (n,pi,...,pi,pt — 
1, • • •) <iex ("(ft)- Finally, the active letter cannot be to the left of a since w\b remains 
inert. □ 

Lemma A.6. A p-minimal derivation is pure. 

Proof. Assume tc is not pure. Then it inserts at some step a useless letter a that stays 
inert and is eventually deleted. By not inserting a and not deleting it later, one obtains 
an equivalent but shorter derivation. □ 
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B Proof of the Closure Property (Prop. 4.2) 

Let G = (Z,P,g) be some arbitrary LGr. The following two observations are easy. 

Fact B.l Assume uau'g ^>* G vg is a derivation where the letter a is inert and eventually 
erased. Then ua n u'g => G vg for all n G N. 

Fact B.2 Assume a does not occur in u. Then ug => G vav'g iffug => G vaav'g. 

Now, in the case where G is an LTr (A,D,P,g), we deduce ZU . w .R G C R g from 
Fact B.l, and R G . « C fl G from Fact B.2. This entails R G = =U • ~ Rg- □ 

C Proof of the Composition Lemma (Lemma 4.4) 

There only is to prove that R Gl , Gl Q R Gl .R Gl . For this we consider a greedy derivation 
71 = (ug => Gi Gz vg) with u G A j and v G C|, and consider two cases: 

1 . If JC never uses a rule from G2, then no symbols from C2 are inserted and necessarily 
v = e. We obtain uR Gl .R Gl v by observing that uR Gl e (as witnessed by 71) and that £/?c 2 £ 
(true of all leftist transformations). 

2. Otherwise, we isolate the first G2 step and write 7C under the form 

71] 7C 2 

' " ' 1 " ' 

Ug =>Gi w ^ ^G 2 W ^ =>Gi.G2 

Necessarily, w G A\D\ (Fact 4.1) and, since symbols from A 1 \JD\ are inactive in G2, 
the first G2 step is an insertion by g, i.e., some wg => weg = w'g with e G D2 — B2 U C2. 
Since 7C is greedy, w is inert in 7^2- 

Now, every word along 7C2 is some xyg with x an inert prefix of w and y G Z)^ = 
(Z?2 U C2)*. This claims holds at the first step (since e G D2) and is proved by induction 
for the next steps. Assume that the fc-th step is some xyg zg: since x is inert, the active 
letter is in y, hence in D2 (by ind. hyp.) and the step is a G2 step. If the step is a deletion 
step, of the last letter in x or of some letter in y, zg satisfies the claim. If the step is an 
insertion step, the claim is satisfied again since G2 can only insert letters from D2 and 
to the right of x. 

Finally, 7C must have the form ug =^ Gl wg ==> Gl vg. Since symbols from A 1 UB\ can- 
not be erased by G2 rules, then necessarily w G C\. Hence uR Gl w and wR Gl v, proving 
uR Gl .R Gl v. □ 

D Proof of Lemma 5.2 

The inclusion Vo./gms C R G is clear in view of V G C R G and since u Iqibs v implies 
u, v G C* and ug => G vg. 

= R G i« s Cfl G n (C* xC*) and 
For the other inclusion, R G C Ve./gins, we consider a greedy derivation 



Ug = W g Wig =^G 



■ =>G Wig = Vg. 
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Every w, is some m,v; with m,- e A* and v ( - 6 C* (Fact 4.1) and, since A is inert in G, m,- 
is a prefix of u (so that we can write u under the form m,.m|). Let A: be the first index s.t. 
Uk = e, so that w, e C* for all z = k, . . . , I and wt /gins v. There remains to show u V G Wk, 
i.e., m[ V g v k . 

For this we show more generally that u\ V G v, for all i = 0, . . . ,k. We proceed by 
induction on ;. The base case clearly holds since u' = vq — e. For the induction step, we 
assume «• V G v,- and a witness ft, : {1, . . . , |m-|} — >• {1,. . . , |v,|} for some i < k. Consider 
the step U(Vig => r ui + \Vi + \g. There are two cases. 

(1) If r = c — > c' is an insertion rule, then the insertion must take place in front of v, 
otherwise the derivation is not leftmost (the first letter in v, cannot be inactive since 
it cannot be deleted and m,- ^ e must be deleted). Hence v,-+i = c'v,, u' i+l = and a 

def 

witness ft' for u' i+l V G v,-+i is obtained from h with = h{i) + 1. 

(2) If r = c--->a is a deletion rule, then m; = Mj+ia, i.e., mJ +1 = at<;, and V g v,+ i (= v,-) 

is witnessed by h! defined as h'(l) = 1 and /;'(/+ 1) = h(i). □ 



E Proofs for Section 7 
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Fig. 6. A schematic type forF Gai . a2 . 
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E.l Proof of Lemma 7.2 

1. We first prove, by induction on the length of a derivation ug v\v 2 g, with v\ G 
(A + bi)* and v 2 G (Z) + fe 2 )*> that it can be mimicked as □ 1 .a.« =>^ G □] .a.vi .J.v 2 .g 
for some y G D' \ {£>[}. First, and since fc contains rules "kept" from G, it can mimic 
any G-step that does not delete a letter from A + b\. For steps v\.a.d.v 2 =>g v\.d.v 2 
using <f ---> a with a G A + b\ and c/ G D then, for all y G (D 1 \ {//J)*, the following 
derivation exists: 



C' + b' 2 . This means erasing a, and replacing y (that may belong to D' \ C'), using a 
primed version of the first symbol of v. Formally, we use 



2. If F G uses a kept rule, then the same rule exists in G and is also usable. If a 
mirror, clean, or fo-rule is used then G can mimic by doing nothing. If a replace rule 
occurs in a step of the form U\.(X.v\.a.d' .v 2 =>™ place U\.<X.v\.d' .v 2 ,a EA,d' G D' ' ,v\ G 
(A + b\ )* , V2 G (D \ }) + then in a previous step there was the letter d at the head of 
the (D \ {^i}) + part of the word (in order to insert d') and a was not deleted. At that 
time G could delete a to finally reach vi . v 2 - □ 

E.2 Proofs for the Correctness of H 

Let a, a' G (A' + b\) + and M ,vG {A + b x )*. 

Lemma E.l. If the derivation r = Ui.a.u.g rh.a'.v.g is greedy, then there are no 
insertions of a U\ to the right of another Hi or of a letter G (A' Ub[), and there also 
are no insertions of a U 2 to the right of another U 2 or a of letter G (D' \ {bi}). 

Proof (Idea). If there is that kind of insertion, then the □ is kept until the end of the 
derivation or deleted. Since that letter can't insert or delete in that position, then if it 
is kept, there is always some letter on its left. There is only one □ at the end of the 
derivation, so the letter is deleted. That □ has no descendant, is not present at the end 
and did not delete anything, so it is useless, which contradicts the greediness hypothesis. 

Lemma E.2. The following three languages are invariants ofH: 





□2-p.v.g. 



L*.(A+D).Z .Z*, 

r.(A+bi).(z D +A'+b\).r, 

L*.(D\b l ).(L a +D'\b' l ).L*. 



(h) 
(h) 
(h) 
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kept ;.Ci A U {bi } 




replace 



clean CT.^'U^U 



fe-rules 



D !D) kept 



filean 



»' ',/<; i dean 

,1 



fe-rules 



□ l i-rules D2 



Fig. 7. Type of # G . 



Proo/ We only prove the first invariant, the other two rely on similar arguments. Let 
w = u.l. □ .v with I G A UD and □ e E Q . The invariance of I\ could be violated by 
inserting a symbol between / and □, or deleting I or □. □ cannot delete I or insert 
letters, and □ can only be deleted by a letter from E Q , a situation where the invariant is 
preserved. □ 

We say that a word a.w blocks a language L C E* if for all v G L, for all u G £*, for all 
derivations Jt = (u.a.w.v.g xg) where ww is inert, t/.a is not deleted. The definition 
means that no vg with v G L can erase anything left of w. Obviously, since we only 
consider derivations with uw inert, any w' with a.w C w/ blocks L when a.w does. 

The arguments used to prove Lemma E.2 can be reused to show the following: 

Lemma E.3. For all a G (AU{b\}) and all d G (£> \ {fci}), 

a Wocfcs (A' + b\ + Di + D2)-£*, 
c/ Moc/b (£>' \ {fe'J + Di + D2).£*, 
a.J Wocfa (AUA'u{fci,fci,Di}).Z*, 
t/.fl Wocfa (D U £>' \ {bi , ^ } U { n 2 }) ■!■* , 
a.c/' Moc/b (A'U{^}).Z*, 
t/.fl' Moc/b (D'\{b[}).T,*, 
a.d'.d.a blocks L*, 
d.a'.a.d blocks E*. 



vi 



Technical appendix, not for the proceedings version. 



We now have the necessary tools to prove that a greedy derivation by H alternates 
between two specific modes where there is no confusion between G steps and steps by 
the renamer: We define Lac,Lca C L* as 

Lac = + Dj 2 .(A + bi)" 3 .(D' \ b[)" 4 .(D \ bi)" 5 .g | s.t. c}, 

Lca = {Ui.{D' \b\r .(D^b^.iA 1 +b' i r -(A+bi)" 5 -g | s.t. c}, 

where numbers n\ ,ri2, ■ ■ ■ ,«5 eN can take any values that respect the C constraint: 

»2 < 1, 

A if m = then «2 > 0, 

A if «2 > then nj, = 0, (C) 
A if «3 = then «4 > 0, 
A if «4 > then «s > 0. 

Let a,p e A' + , and u,v eA*. 

Lemma E.4. If there is a derivation H\.a.u.g U\.$.v.g, then there is an equivalent 
greedy one such that each step is in Lac U Lca- 

Proof. Consider the greedy derivation with the fewest number of steps out of Lac ULca- 
We assume that this number is > and obtain a contradiction, thus proving the 
Lemma. For this we consider the first step that goes out of Lac ULca- Assume that this 
step is wg =>h ■■■ for w.g 6 Lac (hence w can be written under the form Hi-X. D^ 2 
.Y.Z.T.g) and proceed by a case analysis of which letter is active in this step: 

- Di has no rule that can be applied here. 

- some letter x G A' U b[ in X: 

• x —> a', a' e A' U b\ stays in Lac, 

• x ---> c, c G C is not usable, 

• x — > Di forbidden by Lemma E. 1 (whether the insertion is at the head of X or 
inside it), 

- nr. 

• D2 ---> □] leads to Lca (since necessarily in this case n\ = 0, «2 = L »3 = 0, 
«4 > 0, and «5 > 0), 

• U2 — > a', a' G A' U fcj stays in L A c, 

- some letter xeAUfci in F: 

• x ---> a or x — t- a, a G A U b\ stays in L^c, 

• x — > a', a' G A' U foj stays in L^c if inserted at the head of Y, is forbidden by 
Lemma E. 1 if inside of Y, 

- some letter i£fl' \ b\ in Z: 

• x ---> d', d' G D' \ foj stays in Lac, 

• x ---> a, a G A Ubi stays in L^c, 

• x — >• D2 if «3 > or inserts inside Z, forbidden by Lemma E.2, else stays in 
Lac if «2 = 0, else forbidden by Lemma E.l, 

- some letter x G D \ b\ in L: 

• x ---> <f, or x —> d, d G D \ b\ stays in Lac, 
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• x — > d', d! G D' \ b\ stays in L AC if inserts at head of T, else forbidden by 
Lemma E.2, 

- 8- 

• g —> d, d £D-^bi stays in Lac, 

• g — > a, a G A Ub\: 

* if «5 = then «4 = and «3 > so the step stays in Lac, 

* if «5 > 0: 

• if «4 = then nj > 0,«2 = and «i > 0: 

Then w = U\.X.Y.T with \T\ > 0. We write T = yv with y £ D \ b\ 

and let w' = w.a. After the insertion of a, w is inert (by leftmost). Since 

y does not appear at the end of the derivation, it is deleted. 

If y is deleted by a descendant of a letter of A Ubi, then it is by some 

a' G A' + b[. The word at the step just after the deletion would start 

with Di .u.u.a' which is forbidden by Lemma E.2. 

The only other letters able to delete y are letters from D\bi. When g 

inserts the first d G D\b\ after a, the letter at its left is a a. So there is 

d.a as subword of the inert letters. Since d.a blocks {D \ bi).Y.*, the d 

won't be deleted. 

• if n 4 > 

1. if «3 > then «2 = and w is some H\.X .Y.Z.T with non-empty X, 
Y and Z. We write Z = yZ' with y G D' \ b\ and w' = wa. 
oc.M.y'.v'.v, oc G (A' + fo' 1 )+,M G (A + fei)+,y eD'x&l/ G (£>' \ 
/?i)*,v G (£)\fei)* and w' = w.a. 

After the insertion of a, w is inert (by leftmost). Since y does not 

appear at the end of the derivation, it is deleted. 

It cannot be deleted by descendants of A Ub\, which could be □[ 

because (A+b\).ni is a forbidden invariant (Lemma E.2). 

□ i.OC.m cannot be deleted since u.y'.v.a blocks E* (Lemma E.3). 

Hence letters from D' \ b\ are eventually deleted by some □ i which 

is not the one on the left and all the accessible words are in □i.a.w.E*- 

.(□i + Ch) •£*■£, to which Di.p.v.g does not belong. 

2. if « 3 =0 

(a) if «2 = then n\ > and w is some U\X.Z.T with non-empty X 
and Z. We write Z = yZ' with ygD'\ j and w' = wo. 
The Di.a prefix is eventually deleted: the letters from £)' \ b[ 
cannot be deleted without introducing a to the right of the 
first Di. Since in the last word, there is only one Hi, at least one 
of the two is deleted. Since L*.(Di + Oi)-^* is an invariant, if 
the leftmost □[ is not deleted, the word will stay in □!.£*.(□ i + 
□ 2 ).£*. 

So there is a subderivation of the form 



□ i X .y.Z" .g=> ro ni.X .y.Z' . T.a .g => ri Ui .x.g 
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such that Hi.x'.g is the step where the n 2 from U 2 .x.g is deleted 
and □] X.y.Z" .g is the last step where y is active or when it was 
inserted. 

We can transform the sub-derivation such that Hi-X .y.Z" .g =>■ 

ni.x.n 2 -y.z".g^ x \ +l ui.y.z" .g=> r « n 2 .y.z.T.a.g^ n 2 .n 2 

.x.g ^ 2 m. U\ .x'.g Ui.x'.g ^ \3i.fi.T.g, where r\ is r\ 
without the steps deleting letters from H\.X. 
That derivation is still greedy, and has strictly fewer steps out of 
Lac U Lca than the original. 

(b) if«2>0and«i > then the rule U 2 —>a, a' eA'Ub[ has priority 
over the insertion (by eagerness), 

(c) if «2 > and n\ = then the rule n 2 —> Di has priority over the 
insertion (by eagerness). 

The case where w.g G Lca is symmetrical. The only differences are the reasons 
why some letters must be deleted. In the Lac case, when a letter a eAUA'U {bi,b[} 
is inert and has some letter from D'UD at its left, it must be deleted to permit the 
deletion of the letters from DUD' \ {b[,b\}. In the Lca case, the equivalent letter is in 
DUD' \ {b' v b{\ so is not present at the end. □ 

E.3 Proof of Lemma 7.4 

Let a,p G {A' + b[)+ andt/,vG (A + bi)*. 

Lemma E.5. If there is a derivation U\.a.u.g Di.p.v.g, then there exists some 
n and some words u\,v\,u 2 ,...,u n ,v v ,u n +\ such that, for all i < n, U[ G n\.(A' + 
b' l )+.(A + b l )+, Vi G a 2 .(D'\{b 1 })+.(C + b 2 ) + and 

(ui.g =>£ G Vi.g u i+ i.g, 

\ Di.a.u.g =>* Fr ui.g,u n+ i.g =>£ R Oi.fi.v.g. 

Proof. Let a, p G (A' + b\) + ,u, v G {A + b\)* and a derivation □ 1 .0C.w.,t> ni.p.v.g. 
By Lemma E.4, we know that there is a greedy derivation, say wq => w\ . ..w n , such 
that every w; is in Lac U Lca- We first note that if Wj.g G Lac an d Wi'+i-g G Lca, then 
w;+ 1 .g G □ 2 • (D' \ b\ ) + . (D \ 2> i ) + .g. 

Note also that if w t .g G □ 2 . (D' \ fe'j ) + . (D \ b i ) + .g and w i+ 1 .g ^ □ 2 . (D' \ ) + . (D \ 
then w i+1 .g G a 2 .(D' \ ^)+.(D \ ii)+.(A + fti).*. 

We will choose as v* all such G D2-(D' \b[) + .(D\bi) + and vv/ ; +i G D2-(D' \ 
b[ )+ . (D \ b\ )+ A Similarly for k,-, W*,. G □ i . (A' + ^ )+ . (A + 2>i )+ and w ki+ i G U 2 . (D' \ 
fti)+.(D\fti)+(A + fti). 

We now see that Vi,_/i < fc<- and < • This is directly implied by the fact that 
there is no way to go from D2.(D' \ b\) + . (D \ b\) + . (A + b\) to U 2 .{D' \ b\) + . (D \ 
in L C a- So there is a step in L^c and the first step is in Hi. (A' + b[) + .(A + b\) + . 

Next, we notice that since from a step □ 2 . (D' \ b \ ) + . (D \ Z? i ) + => □ 2 . (D' \ j ) + . (D \ 
fei)+. A (included) to a step □i.(A / + fe / 1 )+.(A + fe 1 )+ => Hi.(A' + b' l )+.(A + b 1 )+.(D\ 
b\) (excluded) there are only letters from □] +A' +A +b\ +b[ +g active and g only 
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inserts letters from A +b\. Those are rules from Fr, i.e., Vj.g =>p R Ui + \.g. Conversely 
iij.g ^>p c Vj.g by the same arguments. 

To conclude, we need to show that in fact v; G 02-(C + b' 2 ) + .(C + £>2) + - There are 
no rule in Fr deleting letters from BUB' \ {b\,b\} so if Vi.g ==>^ R iii + \.g, since there 
are no letter from BUB' \ {b\,b\} in Ui + \.g, then there were none in Vi.g. □ 

We can now prove Lemma 7.4: consider a derivation U\.a.u.g ^>* H Hi.fi.v.g. With 
Lemma E.5, we can find m,, v, such that m, e □ i.A' + A+, v, G □2.D' + .C+. Then for all 
i < 

and Qi .a.M.g ^ R u\.g,u n+ \.g =>£ R ni.p.v.g. 

Let us write m;,v; as m,- = Di.a.M^a G A'+,m'. g A+ and v,- = D2.p.v-,P G v- G C+. 

This means that UiSF G Vi and v,-5/r R Mi+i. Hence mJSgV; and vJSrmJ +1 using Lemma 7.3. 
With Sr = « . C .ft and .Sg. C c ~ Sg we have mJ(5g-/i)mJ +1 . So u[(SG-h) n u' n+l . 

Thus mi and m«+i are in Di A /+ A + and 

□i.a.M ^* Fr U\.a!.u\ and Di .a'.u' n+l =>* Fr v. 

Since R is a simple transformer, it has no rules a ---> b for a,b G A so there are only 
insertion on A between m and k'j and between u' n+l and v. Thus m C k'j and C v. 
This concludes the proof that u C k'j (5g-^)"m^ +1 C v. □ 

E.4 Proof of Lemma 7.5 

Let be a greedy derivation □[ .ti.g =>j^, D2- di P-v.g. 

First note that u is deleted before the insertion of a letter from D since every letter 
from A blocks D. 

There could be a derivation where a word from □ 1 A'* .A + .D.g is reached. But since 
□ 1 will be eventually deleted by B\, it is possible to find another greedy derivation 
where a Di is inserted which inserts B\ and delete before the insertion of d. 

The rules used before the first insertion of a letter of D define a renaming from A to 
A. So there is u' such that u C u'(Sc-h)*v. □ 

E.5 Finals steps of the construction: T\ and T2 

7i is defined as (A UA', {□i,2>i,2/ 1 }, {ti2,0i},A UA' U {b\ ,b\, Di}, 02,01 ,Pi,g) with 
the following set of rules: 

all /--»/', for Z € {Di.fei, fe'JUAUA', 

all g — > di, for dj G A, 

all di —7- a'/, for a,-, dj G A, 

all a,- — > b\ , for di G A, 

all ^'1 a', for a' eA'Ub\, 

alia' ^b', for a', 9 eA'Ub\, 
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all a' ->• □ i , for a> ei'U b\ , 
□ l — >-oi. 

01 — > ri 2 . 

We let the reader check that these rules ensure the satisfaction of (7\-spec). 

We further define T 2 = (A U A' U {b\ , b\ , □ i } , {01 , o 2 } , A , o\ , o 2 , Pi , g) with the fol- 
lowing set of rules: 

all g — > a\, for a\ e A , 
all a} — > a), for a], a) e A, 
all a} —7- 02, for a] £ A , 
all d\ ---> cii, for d\ G A, 

all a; --->/, for aj G A and / G A' U {b\, b\ , □ i }, 

02 --->oi. 

We let the reader check that these rules ensure the satisfaction of (T2-spec). 



